ESOM Visualizations for Quality Assessment in Clustering

نویسندگان

  • Alfred Ultsch
  • Martin Behnisch
  • Jörn Lötsch
چکیده

Classical clustering algorithms as well as intrinsic evaluation criteria impose predefined structures onto a data set. If the structures do not fit the data, the clustering will fail and the evaluation criteria will lead to erroneous conclusions. Recently, the abstract U-matrix has been defined for emergent self-organizing maps (ESOM). In this work the abstract forms of the Pand the U* are defined in analogy to the Pand the U*-matrix on ESOM. The abstract U*-matrix can be used for AU*-clustering of data by taking account of density and distance structures. For AU*-clustering the structures seen on the ESOM serve as a supervising quality measure. In this way it can be determined whether an AU*-clustering represents important structures inherent to the high dimensional data. Importantly, AU*-clustering does not impose a geometric cluster shape, which may not fit the underlying data structure, onto the data set. The approach is demonstrated on benchmark data as well as real world data from spatial science.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Machine-learned cluster identification in high-dimensional data

BACKGROUND High-dimensional biomedical data are frequently clustered to identify subgroup structures pointing at distinct disease subtypes. It is crucial that the used cluster algorithm works correctly. However, by imposing a predefined shape on the clusters, classical algorithms occasionally suggest a cluster structure in homogenously distributed data or assign data points to incorrect cluster...

متن کامل

ESOM: An Algorithm to Evolve Self-Organizing Maps from On-Line Data Streams

An algorithm of evolving self-organizing map (ESOM) is proposed as a dynamic version of the Kohonen self-organizing map, where network structure is evolved in an on-line adaptive mode. Experiments have been carried out on some benchmark data sets as well as on macroeconomic data. Results show that ESOM is a good tool for clustering, data analysis, and visualisation.

متن کامل

Clustering with Swarm Algorithms Compared to Emergent SOM

Swarm Based clustering (SBC) is a promising nature-inspired technique. A swarm of stochastic agents performs the task of clustering high-dimensional data on a low-dimensional output space. Most SBC methods are derivatives of the Ant Colony Clustering (ACC) approach proposed by Lumer and Faieta. Compared to clustering on Emergent Self-Organizing Maps (ESOM) these methods usually perform poorly i...

متن کامل

A Self-Organizing Map with Expanding Force for Data Clustering and Visualization

The Self-Organizing Map (SOM) is a powerful tool in the exploratory phase of data mining. However, due to the dimensional conflict, the neighborhood preservation cannot always lead to perfect topology preservation. In this paper, we establish an Expanding SOM (ESOM) to detect and preserve better topology correspondence between the two spaces. Our experiment results demonstrate that the ESOM con...

متن کامل

ESOM-Maps: tools for clustering, visualization, and classification with Emergent SOM

An overview on the usage of emergent self organizing maps is given. U-Maps visualize the distance structures of high dimensional data sets. P-Maps show their density structures and U*-Maps combine the advantages of the mentioned maps to a visualization suitable to detect nontrivial cluster structures. A concise summary on the usage of Emergent Self-organizing Maps (ESOM) for data mining is give...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016